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Abstract 

Alon et al. (2004) [3] introduced the problem of learning a labelled graph 
from a given family of graphs by non-adaptive algorithms. In this paper, we 
present a generalization of this problem. Precisely, we consider a graph G and a 
subgraph H of G and we assume that G contains exactly one defective subgraph 
isomorphic to H . The goal is to find the defective subgraph by testing whether 
an induced subgraph contains an edge of the defective subgraph, with minimum 
number of tests. We give an upper bound for the number of tests to find the 
defective subgraph by using of the symmetric and high probability variation of 
Lovasz Local Lemma. 

Keywords: Group testing on graphs, Non-adaptive algorithm, Combinatorial 
search, Learning hidden subgraph. 

1 Introduction 

In the classic group testing problem that was first introduced by Dorfman [5] , there 
is a set of n items which contain at most d defective items. The goal of this problem 
is to find the defective items with minimum number of tests, which every test can 
contain some items and each test is positive if contain at least one defective item 
and otherwise, test is negative. There are two types of algorithms for group testing 
problem, sequential and non-adaptive. In the sequential algorithm, the outcome of 
previous tests can be used in the future tests and in non-adaptive algorithm all tests 
perform in the same time and by the result of all tests we should find defective items. 
Group testing has many applications including finding pattern in data, DNA library 
screening, and so on, for an overview of results and more applications, we refer the 
reader to [6, 7, 11]. 

Aigner [1] proposed the problem of group testing on graphs. In this problem, we 
are looking for one defective edge of the given graph G by performing the minimum 
sequential tests, where each test is an induced subgraph of the graph G and the test 
is positive if it contain the defective edge. 

In this paper, we consider the problem of non-adaptive group testing on graphs. 
Suppose there is a defective subgraph of G isomorphic to a graph H. Each test F is 
an induced subgraph of G and the outcome of test is positive if F has at least one 
edge in common with the defective subgraph. This is a generalization of the problem 
of learning hidden subgraph posed in [2]. More precisely, in learning hidden subgraph 
problem, the graph G is a complete graph, see [8] for some alternative formulation. 
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2 Notation 



Throughout this paper, we suppose that H is a subgraph of G with k edges and 
Hi, H2, ■ ■ ■ , H m are all subgraphs of G isomorphic to H. Moreover, we assume 
that G contains exactly one defective subgraph isomorphic to H. We denote the 
maximum degree of H by A = A(H). Also, G[X] denotes the subgraph of G induced 
by X n V(G) and for any vertex v € G, Njj(v) stands for the set of neighbours of 
the vertex v in the graph H. Hereafter, we assume that the subgraph H has no 
isolated vertices, because in the problem of group testing on graphs, vertices are not 
defective. 

A boolean matrix is said to be d- disjunct if for every column Cq and every choice 
of d columns C\, C2, . . . , C^ ( different from Co), there is at least one row such that 
the entry corresponding to Co is 1 and the entries corresponding to C\ , C2, ■ ■ ■ , Cd are 
all zeros. This concept was first introduced in [10]. To find the defective subgraph, 
by Theorem 2.1 of [9], it is sufficient to create a 1-disjunct matrix C such that 
Cij = 1 if the subgraph Hj and the test Fi have at least one edge in common, 
otherwise Cij = 0. 

3 Main result 

Suppose that T\, F2, ■ . . ,Tt C V(G) are t-independent tests such that for any F\, 
where 1 < I < t, a vertex v £ V(G) is chosen independently with probability p. For 
simplicity of notation we write Fi as an induced subgraph of G on vertices of Ti . 

Let C be a random txm matrix such that for any j and I, where 1 < j < m and 
1 < I < t, if E(Fi n Hj) 7^ 0, then cu = 1; otherwise, cy = 0. For any i,j, I, where 
1 < i 7^ j < m and 1 < I < t, define the event A\ j to be the set of all matrices C 
such that cu < cy. Also, define the event Aij to be the set of all matrices C such 
that for every I, 1 < I < i, we have Qj < cy. In other word, the event occurs 
if test Fi cannot distinguish between Hi and Hj. Also, the event Aij occurs if for 
every I such that 1 < I < t, test Fi cannot distinguish between Hi and Hj. 

In the main theorem, we show that the matrix C is a 1-disjunct matrix with 
positive probability. It means that for every % and j, there are two numbers 1 < I, I < 
t, such that E(Hi n F{) / 0, J5(fTj- D Fj) = 0 and E(Hi f\F v ) = 0, F(iJ, ClF v ) ^ 0. 
Consequently, by Theorem 2.1 of [9] , we can find the defective subgraph with at 
most t non- adaptive tests. 

In the main theorem we use Lovasz Local Lemma. In this regard, define the 
parameter rc(H), to find the maximum degree of the dependency graph for the 
events A itj . Define r G (H, Hi) = \{Hj : Hj is isomorphic to H, V(Hi) n V(Hj) / 0}\ 
and set ra(H) = max ra(H, Hi). 

i 

Theorem 1. Let for the defective subgraph H we have E(H) = k, A(H) = A. If 
one of the following conditions holds, 



1. k > 2A(A - 1) + 1 andy = 

2. k < 2A(A - 1) + 1 andy = 
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then we can find the defective subgraph H with t non-adaptive tests, where 



, , , . j r log(4er G (i?)) + logm 2logm ( 

( = 1 + mm { r ^5 ^i^ 1 )' 

To prove the main theorem, we need some supportive results. 

Lemma 1. Let H be a graph with n vertices, k edges, and maximum degree A. 
Pick, randomly and independently, each vertex of H with probability p, where p = 
Y^(l — f If F is the set of all chosen vertices, then H[F] is an independent 

set with probability at least 1 — e. 

To prove this lemma, we need high probability variation of Lovasz Local Lemma. 

Lemma A. [4] Let B%, Bi, ■ ■ ■ , Bk be events in a probability space. Suppose that 
each event Bi is independent of all the events Bj but at most d. For 1 < i < k and 

k 

0<e<l, ifPr(B t ) < f(l- l) d , then Pr(f|^) > 1 - e. 

i=i 

Proof of Lemma . . Let E(H) = {e\, e2, ■ • • , e^}. For 1 < i < k, we define Bi to 
be the event that et £ E(H[F]), so Pr(Bi) = p 2 . Since vertices are chosen randomly 
and independently, the event Bi is independent of the event Bj if and only if edges 
ei and ej have no common vertex. So the maximum degree of the dependency graph 

k 

is at most 2(A - 1). Since p 2 < f (l - f) 2(A_1) , by Lemma A , Pr( f] B^j > 1 - e. 

i=i 

Hence, H [F] is an independent set with probability at least 1 — e. ■ 

To find the probability that tests Fx, F2, . . . , Ft can distinguish between every 
pair of subgraphs Hi and Hj, we should know the construction of Hi and Hj and 
their intersection. So we just find the lower bound for this probability. In the next 
three lemmas we find three lower bounds for this probability in three special cases. 

Lemma 2. Suppose V(Hi) = V(Hj) and \E(Hi)\E(Hj)\ = 1. If one of the following 
conditions holds, 

1. fc>2A(A-l) + l ondp = vH k 1 - fc-2A(A-i)-i < " A ' 

2. k < 2A(A - 1) + 1 and p = */f (1 - e) A_1 



k 

then Pr(E{HiC\Fi) / 0,E(H j f]F l ) = 0) > p 2 {I - p) 2 ^-^ (I - e). 

Proof. Let e = {u, v} 6 E(Hi)\E(Hj). If HjdFi is an independent set of Hj and 
u, v £ Fi, then E(Hi n F[) ^ 0 and E(Hj n F/) = 0. Suppose that H' is an induced 

subgraph of Hj, where V(H') = V(Hj) \(uUvU N(u) U N(vj\ and \E(H')\ = k'. 

So k' > k - 2A(A - 1) - 1 and \V(H')\ > \V(H)\ - 2A. 

It is easily seen that u,v G T\ and Hj\F{\ is an independent set if and only if 
u,v £ T\, Nu, (u) n Ti = 0, Nffj (v) n Ti = 0, and H'[Fi] is an independent set. 
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One can see that Pr(u,v G P/) = p 2 . Since \N Hj (u)\JN Hj (v)\{u,v}\ < 2(A-1), 

the probability that P/ n \ N Hj (u) U N Hj (v) \ {u, v}j = 0 is at least (1 - p) 2A ~ 2 . 

/ \A-1 

1. Suppose k > 2A(A — 1) + 1 and p = y| ( 1 — fc_2A(A-i)-i ) ' ^ ne can 

check p 2 < p (l — p) 2A 2 , and by Lemma 1, the induced subgraph on p n V(H') 
is independent set with probability at least 1 — e; in other words, Pr(P(p n P') = 
0) > 1 — e. Hence, 

Pr(£?(ffinfi)^0,£7(^nF,) = 0) >p 2 (l-p) 2A - 2 (l-e). 

2. Now suppose fc < 2A(A - 1) + 1 and p = (1 - e) A_1 . If k' = 0, then 

Pr(E(Fi n H') = 0) = 1. Else if jf > 1, One can check p 2 < § (l - p) 2A ~ 2 , 
and by Lemma 1, the induced subgraph on p n V(H') is an independent set with 
probability at least 1 — e. In other words, Pr( y E{Fi n P') = 0) > 1 — e. Hence, 
Pr(E{H i C\F l )^0,E(H, J C\F l ) = 0) > p 2 (1 - p) 2A ~ 2 {1 - e) . ■ 

In the previous lemma two subgraphs P^ and Hj are very similar. By analysis of 
next two cases, we show that in this case the probability of distinguishing between 
Hi and Hj is minimum. 

Lemma 3. Suppose \V{Hi) \ V(Hj)\ = 1. // either 

k > 2A( A - 1) + 1 and p = J\ (l - fc _ 2A( i_ 1) _ 1 ) \ or 
k < 2A(A - 1) + 1 and p = J\ (1 - e) A "~\ then 



Pr (P(P, n Ft) + 0, E(Hj n F,) = 0) > p 2 (l - p) A (l - e). 

Proof. Since H has no isolated vertices, one can see that there exits at least one 
edge e = {u,v} G P(P;) \ E(Hj). Let v G V(-Hi) D F(Pj-) and u G V(fli) \ V(Hj). 
Suppose that H' is an induced subgraph of Hj, where V(H') = V(Hj) \(dU N(v)) 
and \E{H')\ = k' . So k' > k - A 2 and \V(H')\ > \V(H)\ - A - 1. 

Similar to the proof of Lemma 2, if u, v G T\ , Njjj (v) (~)Fi = 0 , and H ' [Fi] is an 
independent set, then P(Pj D F t ) ^ 0 and F(Pj n P) = 0. 

Since |iV^(t;)| < A, the probability that Nh^v) n P/ = 0 and u,v G Fi is at 
least p 2 (l — p) A . 

A-l 

, then 



If Jfe > 2A(A - 1) + 1 and p = (l 



fc-2A(A-l)-l 

(\ 2A— 2 2A— 2 

'-i^aO s f('-f) 

Hence, by Lemma : , Pr (P(p D P') = 0) > 1 — e, which implies 

Pr (E(Hi n Fi) + 0, P(Pj n P) = 0) > p 2 (l - p) A (l - e), 
Now suppose k < 2A(A - 1) + 1 and p = J~l (1 - e) A_1 . If fc' = 0, then 



Pr(E(Fi n P') = 0) = 1. Else if k' > 1, One can check p 2 < § (l - ^) 2A 2 , and 
by Lemma I, Pr (P(P n H') = 0) > 1 - e. 

Hence, Pr(^E(Hi n P) / 0, P(Pj D P) = 0) > p 2 (l - p) A (l - e). ■ 
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Lemma 4. If the induced subgraph on V(Hi) — V(Hj) has at least one edge, then 

Pr(E(Hi n Fi) + 0,E(Hj D F) = 0) > p 2 (l - e), 
where p=yfl (l - § ) 2A ~ 2 . 

Proof. Let e = (it, d) € E(Hi) \ E(Hj). If -£/j[F] is an independent set and 
u, v e F h then E{Hi nFj)/0 and E(Hj n F) = 0. Since Pr(u, d £ J]) = p 2 and 

p 2 < § (l — f) 2A 2 ; by Lemma 1, Pr (E(Hj n F) = 0) > 1 — e; and consequently, 
Pr(F(F, n F) / 0,F(i^- n F) = 0) > p 2 (l - e). ■ 

In the three previous lemmas, we find three lower bounds for the probability of 
distinguishing between Hi and Hj. By the use of next lemma we generalize these 
three special case to general case. The proof of the following lemma is straightfor- 
ward. 

Lemma 5. If E(H t )nE{Hj) = f 2 , . . . , f r } and E{H t )\E(Hj) = {ei, e 2 , . . . , e*_ r }, 
i/ien /or every 1 < q < k — r and 1 < I <t, 

Pr[E(Hi n F) / 0, F(#, n F) = 0) > Pr(e g £ F(F D fli), E(H 3 D F) = 0) . 

In the next theorem, we show that the probability of distinguishing between Hi 
and Hj has the minimum value, whenever V(Hi) = V(Hj) and \E(Hi) \E(Hj)\ = 1, 

Theorem 2. Ze£ |F(iF| = fc and A(H) = A. // either 

1. fc>2A(A-l) + 1, e =a L «nrfp=7f (l- F ^^ IFT ) A \ or 

2. fc < 2A(A - 1) + 1, e = 2s and p = (1 - e) A_1 , 
then for every 1 < i 7^ j < m and 1 < Z < t, we have 

Pr(A^) >p 2 (l-p) 2A ~ 2 (l-e). (1) 

Proof. LetF(^)nF(^) = f 2 , . . . , f r } and E(Hi)\E(Hj) = {e 1; e 2 , . . . , e fc _ r }. 

The event A| • occurs if E(Hi n F) 7^ 0 and F(ilj n F) = 0. By Lemma 5, for 
every 1 < q < k — r, 

Pr(E{Hi n F) / 0, E(Hj n F) = 0) > Pr (e g £ F(F D E{Hj n F) = 0) , 

so we need to consider the following three cases, 
case 1: V(Hi) = V(H 3 ), \E(Hi) \ E(Hj)\ = 1. 

By Lemma , it is clear Pr (^{jj — P 2 ( x ~~ p) 2 ^ A ^(l — e). 

case 2: \V(Hi) \ V(Hj)\ = 1. 

By Lemma 3, we have Pr(A\ j ) > p 2 (l -p) A (l - e) > p 2 (l -p) 2 ( A_1 )(l - e). 
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case 3: The induced subgraph on V(Hi) — V{Hj) has at least one edge. 

.2A-2 



One can check p 2 < | (l — |) , hence, by Lemma t, Pr y^ijj > P 2 ( i — e )- 
Consequently, Pr (^j) > P 2 (l - £>) 2A_2 (1 - e). 



So for every 1 < i / j < m and 1 < Z < t, Pr [A 1 ^ ) > p 2 (l - p) 2A ~ 2 (l - e 



To prove the main theorem, at first we should find an upper bound for the 
probability of occurring the bad event Aij for every 1 < i ^ j < m. 



Theorem 3. Let \E{H)\ = k and A(H) = A. // either 

1 / i \2A-1 / / \2A-2 

1. fc>2A(A-l) + l andy = ^{l-^ I ) h-J^A 



or 



2A-2 



2. k < 2A(A - 1) + 1 andy = ^ (l - ^) 2A ^ (l - ^ (l - ^) ' 
then for every 1 < i ^ j < m 

Pr(Aij) < (1 - yf. (2) 

Proof. Since Fi, F2, ■■ ■ ,Tt C V(G) are chosen randomly and independently, for 
every 1 t, A\ ■ and A\ ■ are two independent events. So 



Pr{A^) = (PriA^ 



Note that Pr \A\^ = Pr {E(Hi n F { ) / 0, F(Fj n F) = 0) . 

1. If k > 2A(A - 1) + 1, e = and p = J% (l - k _ 2 ^_ 1) _ 1 ) ^ then by 



Theorem 2, Fr j) > y'y"> where 



A / 1 x 2A-2 

1 



fc(A + l) 2 V (A + - 2A(A - 1) - 1) 

/ / x A-l^ 2A ~ 2 

1 / 1 x A 1 



k(A + 1) V (A + l)(fc - 2A(A - 1) - 1] 
one can easily seen that 

/ \2A-1 / / \ 2A-2 — — 

yy >y = hattt 1 - att 1 - J ma+d ■ Hence > *M4,-) > and 



fe(A+i) v A+i y V V fc ( A + 1 ) / ' ± ^ 11 ^' 1 ' y-^ij' 
consequently, Pr(Aij) < (1 — y) 1 . 

2. Now suppose k < 2A(A - 1) + 1, e = i, and p = (1 - e) A-1 . By 



Theorem 2, Fr f^ij) — y, where 



I , j n 2A-1 / / 7 / , \ A-l\ 2A ~ 2 



^^l 1 "^] I 1 " V 1 2A 



Hence, Pr(Aij) < (1 - y) 
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Now, we are ready to prove the main theorem. 
Proof of Theorem 1. If either 



1. k > 2A(A - 1) + 1 and y = ( 1 - ^ J ( 1 



2A-1 / / \ 2A-2 

fc(A+l) 



or 



2. k < 2A(A - 1) + 1 and y = ^ (l - ^) 2A ~ 1 (l - ^ (l - 

then by Theorem '.>, for every 1 < i ^ j < m, Pr(Aij) < (1 — y)'. Since the number 

of events A^j is m?, the probability that at least one of Aij's occurs is at most 

m 2 (l - yf. If t > A!^, then m 2 (l - y)* < 1. 
g i-a 

Now we prove that if t > lo s (4er G ) +iog m ^ by Lovasz Local Lemma, with 

& 1-2/ 

positive probability no event Aij occurs. Each event Aij is independent of A^jt if 
and only if [V(Hi) U V(Hjj) n (V (-#*') U V(i/,-/) ) = 0. So the maximum degree 



2A-2 



of the dependency graph is at most ^rG{H){m — 1). Hence, if t > log ( 4e? | G (#)) +iog m ^ 
then e(l-y)*(4r G (ff)(m-l) + l) < 1. So by Lovasz Local Lemma Pr ^ Q > 0. 



Therefore, if t = 1 + min i r log(4er G (g)Hjgg m i r 2logm -i i then there is a i x m 

{ l °Sl^y " ' lOg ' ' 

disjunct matrix. 
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